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Abstract 

Networked control strategies based on limited information about the plant model usually 
results in worse closed-loop performance than optimal centralized control with full plant model 
information. Recently, this fact has been established by utilizing the concept of competitive 
bJQ[ ratio, which is defined as the worst case ratio of the cost of a control design with limited model 

information to the cost of the optimal control design with full model information. In this paper, 
we show that with an adaptive networked controller with limited plant model information, it is 
indeed possible to achieve a competitive ratio equal to one. We show that an adaptive controller 
introduced by Campi and Kumar asymptotically achieves closed-loop performance equal to the 
optimal centralized controller with full model information. The plant model considered in the 
paper belongs to a compact set of stochastic linear time-invariant systems and the closed loop 
performance measure is the ergodic mean of a quadratic function of the state and control input. 
We illustrate the applicability of the results numerically on a vehicle platooning problem. 

1 Introduction 

Networked control systems are often complex large-scale engineered systems, such as power grids [T], 
smart infrastructures [2] , intelligent transportation systems [3H5] , or future aerospace systems [3[7] . 
These systems consists of several subsystems each one often having many unknown parameters. It is 
costly, or even unrealistic, to accurately identify all these plant model parameters offline. This fact 
motivates us to focus on optimal control design under structured parameter uncertainty and limited 
plant model information constraints. 

There are some recent studies in optimal control design with limited plant model information [5]- 
Q3] . The problem was initially addressed in [5] for designing static centralized controllers for a class 
of discrete-time linear time-invariant systems composed of scalar subsystem, where control strategies 
with various degrees of model information were compared using competitive ratio; i.e., the worst 
case ratio of the cost of a control design with limited model information scaled by the cost of the 
optimal control design with full model information. The result was generalized to static decentralized 
controller for a class of systems composed of fully-actuated subsystems of arbitrary order in [51 HP]. 
More recently, the problem of designing optimal H2 dynamic controllers using limited plant model 
information was considered in It was shown that, when relying on local model information, 

the smallest competitive ratio achievable for any control design strategy for distributed linear time- 
invariant controllers is strictly greater than one; specifically, equal to square root of two when the 
_B-matrix was assumed to be the identity matrix. 

In this paper, we generalize the set of applicable controllers to include adaptive controllers. We 
use the ergodic mean of a quadratic function of the state and control as a performance measure of the 
closed-loop system. Choosing this closed-loop performance measure allows us to use certain adaptive 
algorithms available in the literature |15H18) . In particular, we consider an adaptive controller 
proposed by Campi and Kumar 15. We prove that the smallest competitive ratio that a control 
design strategy using adaptive controllers can achieve is equal one. This shows that, although the 
design of each subcontroller is only relying on local model information, the closed-loop performance 
can still be as good as the optimal control design strategy with full model information. 
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The rest of the paper is organized as follows. In Section [2J we present the mathematical problem 
formulation. In Section[3j we introduce the Campi-Kumar adaptive controller using only local model 
information and we show that it achieves a competitive ration equal to one. We use this adaptive 
algorithm on a vehicle platooning problem to demonstrate its performance numerically in Section U 
and we conclude the paper in Section [5] 

1.1 Notation 

The sets of natural and real numbers arc denoted by N and M, respectively We define N = NU{0}. 
Additionally, all other sets are denoted by calligraphic letters such as V and A. For any given sets 
X and y, the notation AA(X,y) denotes the set of all mappings from the set X to the set y. In 
addition, for any k £ No, we define Vk(X) as the set of all subsets of X containing exactly k elements. 

Matrices are denoted by capital roman letters such as A. The entry in the i th row and the j th 
column of matrix A is ay. Ay denotes a submatrix of matrix A, the dimension and the position of 
which will be defined in the text. 

A > (>)0 means symmetric matrix A £ R nxn is positive definite (positive semidefinite) and 
A > (>)B means A — B > (>)0. Let 5" + (<S") be the set of symmetric positive definite (positive 
semidefinite) matrices in R nx ". 

Let matrices A £ R" x ™, B £ R" xm , Q £ S™, and R £ be given such that the pair (A, B) 
is controllable and the pair {A, Q 1 ^ 2 ) is observable. We define P(A, B, Q, R) as the unique positive 
definite solution of the discrete algebraic Riccati equation 

P = A T PA - A T PB (B T PB + R) 1 B T PA + Q. 

In addition, we define 

L(A, B, Q,R) = - (B T P(A, B, Q, R)B + B T P(A, B, Q, R)A. 

When matrices Q and R are not relevant or can be deduced from the text, we use P(A, B) and 
L(A, B) instead of P(A, B, Q, R) and L(A, B, Q, R), respectively. 

A measurable function / : Z — > R is said to be essentially bounded if there exists a constant 
c£l such that \f(z)\ < c almost everywhere. The greatest lower bound of these constants is called 
the essential supremum of f(z), which is denoted by ess sup z62 f( z )- 

All graphs G considered in this paper are directed with vertex set {1, N} for a given N € N. 
The adjacency matrix S £ {0, lJ NxN of G is a matrix whose entry = 1 if (j, i) £ E and s,j = 0, 
otherwise, for all 1 < i, j < N. 

Let mappings /, g : Z — >• R be given. Denote f(k) = 0(g(k)) if limsupj.^^ \f(k)/g(k)\ < oo. 
Similarly, f(k) = o(g{k)) if limsup^^ \f{k)/g{k)\ = 0. 

Finally, \ denotes the characteristic function, that is, it returns a value equal one if its statement 
is satisfied and a value equal zero otherwise. 

2 Problem Formulation 

2.1 Plant Model 

Consider a discrete-time linear time-invariant dynamical system composed of N subsystems, such 
that the state-space representation of subsystems i, 1 < i < N, is given by 

JV 

Xi(k + 1) = AijXj{k) + B H Ui(k) + w^k); Xl (0) = 0, 

where Xi(k) £ M. ni , Ui(k) £ R" li , and Wi(k) £ W H are state, control input, and exogenous input 
vectors, respectively. We assume that {wi(k)}^ are independent and identically distributed Gaus- 
sian random variables with zero means K{wi(k)} = and unit covariances K{wi(k)wi(k) T } = I. In 
addition, let Wi(k) and Wj(k) be statistically independent for all 1 < i ^ j < N. We introduce the 
augmented system as 

x(k + 1) = Ax(k) + Bu(k) + w{k); x(0) = 0, 



2 



where the augmented state, control input, and exogenous input vectors are 

x(k) T = [ Xl (k) T . . . x N (k) T ] T G K™, 
u(k) T = [ Ul (k) T . . . u N (k) T ] T G K m , 
w(k) T = [ Wl (k) T . . . w N (k) T ] T G M™, 

with n = X^i^i n « an( i TO = Eti m «- I n addition, the augmented model matrices are 



and 



B = diag(Sn, 
An •• 



.1? 



AW 



A 



Nl 



N 



i-NN 



eBc 



e Ac 



Let a directed plant graph G-p with its associated adjacency matrix S-p be given. The plant graph 
G-p captures the interconnection structure of the plants, that is, Aij 7^ only if (s-p)ij 7^ 0. Hence, 
the set A is structured by the plant graph: 



Ac {Ae 



(s-p)ij = 



A^ = 0e 



^ for all i,j such that 1 < i,j < N}. 



Note that the set A x B is isomorph to the set of all plants of interest V, say. Hence, from now on, we 
present a plant with its pair of corresponding model matrices as P = (A, B) and denote V = A x B. 
We make the following assumption on the set of all plants: 

Assumption 2.1 The set A x B is a compact set (with nonzero Lebesgue measure) and the pair 
(A, B) is controllable for all (A, B) G A~x B (except for possibly a set with zero Lebesgue measure). 

2.2 Adaptive Controller 

We consider infinite-dimensional nonlinear time-invariant controllers K with control law 

U (fc)=K(J- fe ), VfceNo, 

where Tk = cr({x(t)}^ =0 U {^(i)}^ 1 ) is the sigma algebra generated by the observation history. 
Hence, each controller is a mapping K : £ n > m — > M™ ] where 



£ n > m =\J (J U <r{xuu). 

k=o xev k (R« ) ueVk- 1 (« m ) 

The set of all admissible controllers JC can be captured as the set of mappings from S n,m to R m , or, 
cquivalcntly, JC = M(£ n ' m ,R m ). 

2.3 Control Design Strategy 

A control design strategy T is a mapping from the set of plants V = A x B to the set of admissible 
controllers JC. We can partition T using the control input size as 



1 N 

where, for each 1 < i < N, we have Ti : A x B — > M(£ n ' m ,R" li ). Let a directed design graph Gc 
with its associated adjacency matrix Sc be given. We say that the control design strategy L satisfies 
the limited model information constraint enforced by the design graph Gc if, for all 1 < i < N, Ti 
is only a function of 

{[A,-! ... A^Bjj I ( 8C )^0}. 

The set of all control design strategies that obey the structure given by the design graph Gc is 
denoted by C. 
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2.4 Performance Metric 



In this paper, we are interested in minimizing the performance criterion 

T-l 

J P (K) = limsup - V x(k) T Qx(k) + u(k) T Ru(k), (1) 

where Q £ <S>™ and i? € We make the following assumption concerning the performance 

criterion: 

Assumption 2.2 For all A £ A, the pair (A, Q 1 ^ 2 ) is observable (except for possibly a set with zero 
Lebesgue measure). 

Note that for linear controllers the performance measure (JTJ) represents the i?2-norm of the closed- 
loop system from exogenous input w{k) to output 

y (k) = Q 1 ' 2 x{k)+R 1 ' 2 u{k). 

Definition 2.1 Let a plant graph Gp and a design graph Gc be given. Assume that, for every plant 
P £ V , there exists an optimal controller K*(P) £ JC such that 

J P (K*(P)) < J P (K), VKe/C. 

The average competitive ratio of a control design method T is defined as 

where f : V — > M is a positive-definite continuous function which shows the relative importance of 
plants in V. Without loss of generality, we assume that L, f(P)dP = 1. The supremum competitive 
ratio of a control design method T is defined as 



The mapping K* is not required to lie in the set C, and is obtained by searching over the set of 
centralized controllers. Hence, K*(P) = L(A,B), for all plants P = (A, B) £ V . 

The supremum competitive ratio r™ p is a modified version of the competitive ratio considered 
in [SHTi]. Note that using essential supremum in ([3]), we are neglecting a subset of plants with zero 
Lebesgue measure. However, this is not crucial for practical purposes, because the probability of 
these plants appearing in a real situation is slim. As a starting point, let us prove a very interesting 
property relating the average and supremum competitive ratios. 

Lemma 2.1 For any control design strategy F £ C, we have 1 < r^ e (T) < r™ p (F). 

Proof: See Appendix fff] ■ 

In this paper, we are interested in solving the optimization problem 

arg min r-p (F) , (4) 
rec 

where r-p is either r|, ve or r™ p . This problem was studied in [TT] when the set of plants is fully- 
actuated discrete-time linear time-invariant systems and the set of admissible controllers is finite- 
dimensional discrete-time linear dynamic time-invariant systems. It was shown that a modified 
deadbeat control strategy (which constructs static controllers) is a minimizer of the competitive 
ratio. Specifically, it was proved that the smallest competitive ratio that a control design strategy 
which gives decentralized linear time-invariant controllers can achieve is strictly greater than one 
when relying on local model information. Note that since the optimal control design with full model 
information is unique (due to Assumption 12 . 2 [ ) . even when considering a compact set of plants, the 
competitive ratio is strictly larger than one for limited model information control design strategies. 
In this paper, we generalize the formulation of |11] to include adaptive controllers. We prove in next 
section that we can achieve a competitive ratio equal to one for adaptive controllers. Therefore, we 
can achieve the optimal performance asymptotically, even if the complete model of the system is not 
known in advance when designing the subcontrollers. 
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Algorithm 1 Control design strategy T*(P). 



Parameter: {/j,(k)}^L such that lim^-joo fJ,(k) = oo but n(k) = o(log(fc)). 
Initialize (^(0), B (i) (0)) for all i £ {1, ... , AT}, 
for fc = 1, 2, . . . do 
for i = 1,2,..., N do 
if k = 2, 4, . . . then 

Update subsystem i estimate as 

= argmin W(A,B,T k ), 

(A,B)£AxB 

subject to Ay = Aij,Bu = B U ,M j,l E {1, . . .,iV}, (s c )« / 0, 
A* a = 0, V z, q € {1, . . . , N}, (s v ) zq = 0, 

where 

k 

W(i, B, J- fc ) = tr(P(i, B)) + IW*) ~ Mt - 1) - B«(t - 

t=i 

else 

(,4 (l) (fc),B (l) (fc)) <- (A«(fe-l),B«(fe-l)). 
end if 

Ui(fc) 4- TYRT (i) (A;)a;(fc). 
end for 
end for 



3 Main Results 



We introduce a specific control design strategy T*, and subsequently, prove that T* is a minimizer 



of both the average and supremum competitive ratios r^ e and 



sup 



For each plant P 6 V, this 



control design strategy constructs an adaptive controller T*(P) using a modified version of the 
Campi-Kumar adaptive algorithm [T3], see Algorithm [1] Note that in the Campi-Kumar adaptive 
algorithm, a central controller estimates the model of the system and control the system. However, 
in our modified Campi-Kumar adaptive algorithm in Algorithm [TJ each subcontrollcr estimates the 
model of the system independently and control its corresponding subsystem separately. Hence, each 
adaptive subcontroller arrives at different model estimates. 

In Algorithm [TJ we use the notation (A^(k), B^(k)), at each time step k e No, to denote 
subsystem i's estimate of the global system model P — (A, B). Furthermore, for each 1 < i < N, we 
use the mapping T; : R mxn -> W 



defined as 



N 



X 



Nl 



X 



NN 



[ x a 



X, 



N 



where G 



% i for each 1 < t,j < N. Let us also, for all k £ No, introduce the notation 

K(k) = : 

_ T N K( N \k) 

where matrices K^ l \k) are defined in Algorithm [TJ For each 5 > 0, we introduce 

m(A,B) := {(A, B) eAxB \ \\[A + BL(A, B)] - [A + BL(A, B)}\\ > S}. 

Let us start by presenting a result on the convergence of the global plant model estimates to the 
correct value. 
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Lemma 3.1 Let K = T*(P) be defined in Algorithm^ for a given plant P = {A, B) € V . There 
exists a set Af C V with zero Lebesgue measure such that, if P (£ Af, then 

lim P(A w (fc) B {i \k)) <P(A,B). (5) 

fc— > oo 

and 

fc 

J2x((A^(k),B^(k)) e ^(4,5)) =S O(mW), (6) 



t=o 

fc 



^ X (ll^ (l) (fc)-L(Ai?)|| > P ) = 0^(k)), (7) 
t=o 
fc 

$>(||A(fc) ~ HAB)\\ > p) U 0(m(A)), (8) 



t=0 

a s 



/or <5 > 0, where x — y and x < y mean ¥{x — y} — 1 and ¥{x < y} = 1, respectively. In 
addition, we get 

1 T_1 

limsup - V ||x(fc)|| p + ||u(fc)|| p < oo, Vp > 1. (9) 
T — T k~o 

Proof: See Appendix [Bj ■ 

Note that, according to Lemma r3.il we know that there exists a set Af C V with zero Lebesgue 
measure such that, if P ^ A/", the estimates in the modified Campi-Kumar adaptive algorithm 
(Algorithm [lj converge to the correct global plant model. This fact is a direct consequence of the 
use of regularized maximum likelihood estimators in the Campi-Kumar algorithm |19) . We need the 
following lemma. 

Lemma 3.2 For any matrices X,P,Y E R" xn , we have 

||A T PA - Y T PY\\ < \\P\\\\X - F||(||X|| + ||Y||). 

Proof: See Appendix [C] ■ 
Now, we are ready to present the main result of this section. 

Theorem 3.3 Let K = T*(P) be defined in Algorithm^ for a given plant P — (A,B) € V . There 
exists a set Af C V with zero Lebesgue measure such that, if P (£ Af, then 

Jp(T*(P)) a dj P (K*(P)). 

Proof: The proof follows the same reasoning as in [15] . According to [20,, p. 158], for all 1 < i < N, 
we get the set of equations in 

tr{P(A« (fc), fiW (fc))} + i(fc) T P(A (l) (fc), B w (fc))x(fc) 

= x{k) T Qx{k) +u«(fc) T i?u«(fc) + E{(A^ (k)x(k) + 5«(fc)u«(fc) + w(k)) T 

x P(A®(k),B®(k))(A®(k)x(k) + B®(k)u®(k) +w(k)) | Jfc_i} 
= x(k) T Qx(k) + «W (fc) T i? u W (fc) + E{x(k + 1) T P(AW (fc), B« (jfc)) s (fc + 1) | ^-i} 

+ (AW (fc)a;(fc) + B« (fc)«( l »(fc)) T P(A (l) (fc), £ w (fc))(A« (fc)x(fc) + B« (&)u« (fc)) 

- (Ax(k) + Bu(k)) T P{A^ (k), B^(k))(Ax{k) + Bu(k)), 

(10) 

with u^ l \k) = K^ l \k)x{k) and u(fc) = K(k)x(k). Averaging both sides of (|T0l) over time and all 
subsystems, we get 

T-1 N , T-1 N 



Ci( T ) + ^ E E^(fc) T p(^ ) (fc),B«( fc ) )a; (fc) = A. ^ £„W(fc)Tfl U « (A; ) 

fc=0 »=1 fc=0 i=l 

1 T-1 1 T-1 AT 

+ 7p E *(*) t <m*o + ^ E E E M fc + i) T p(A (2) (fc)^ w (fc))^(fc + 1) i ^fc-ii + caco, 



T x ' " w AT 

fc=0 fc=0 i=l 



G 



where 

T-i JV 



CiCO = ^ E X)tr{P(AW(fc),BW(fc))}, 

fc=0 i=l 

and 

T-l JV 

Ca(r) E E( AW (fc)x(fc) + B« (ifc)«W(*)) T P(AW (jfe), flW (A))(AW (fc)a;(fc) + B« (Jfe)v«(fc)) 



fc=0 i=l 

- (Ax(fc) +Bu(fc)) T P(A«(fc), J BW(fc))(Ax(fc) +B?i(fc)). 

(12) 

Subtracting ^Y^Y^iH^ + l) T P(A^(k+l),B^(k + l))x(k+l) \ F k -i), from both sides 
of ([TT|) while adding and subtracting ^ J2k=o u(k) T Ru(k) from right-hand side of (TTT1) . we get 



1 T_1 

- E [x(k) T Qx(k) + u(k) T Ru(k)} + Q(T) + Cs(T) + C(T) = Ci(T) + C 8 (T), (13) 



where 



fc=0 



T-l TV 

&( T ) = E E^( fc ) Tp ( A(l) ( fc )' s(,) ( fc )) a; ( fc ) 

fe=0 i=l 



-E{i(Hl) T P(i (i) (Hl),B (,) (fc + l)WHl) | J-fc-i}, 

T-l JV 

^cn = Nf E E uW w TiiuW ( fc ) - «(*) T fl«(*). 

fe=o t=i 

and 

T-l JV 

Cs(r) =— E E E ^( fc + i) T [P(^ w (fc), - P(^ (4) (fc + + !))]»(* + 1) I ^-i}. 



fc=0 i=l 



In the rest of the proof, we study the asymptotic behavior of the sequences {Q(k)}^L for all 
1 < t < 5. 

• Asymptotic behavior of Ci(^) : First, note that 



hmsupCi(T) - hmsup-i- E E tr { P ( A( °( fc )' S(l) ( fc ))} 

JV T-l 

= - E sup - E tr{P(A« (fc), flW (*))}. 



1=1 fc=0 

Using ((SJ) inside the above identity, we get 

as 

limsupCi(T) < tr{P(A,B)}. 

T->oo 

• Asymptotic behavior of C3(^) : With a similar strategy as in case (B) in the proof of Theorem 6 
in [15] . we can prove that 



= limsup - E x(k) T P{A^{k),B^{k))x{k) 
T ^°° T fe=0 

- E{x(k + l) T P(A^(k + 1), B«(fc + + 1) | J- fc _i}. 

Hence, limsupj^^ ^(T) = 0. 



7 



• Asymptotic behavior of CiiT)'- In this case, we have 



T-l 



T-l 



fc=0 



| J] u«(fc) T i?u«(fc) - u(k) T Ru(k) <^J2 \u (i} {k) T Ru^(k) - u(fc) T i?u(/fc) 

fe=0 
T-l 

< _ \\K {i \k) T RK {l \k) - K{k) T RK{k)\\\\x{k)\\ 



k=0 



According to LemmaGUl we have \\K^ (k) T RK^(k)-K(k) T RK(k)\\ < \\R\\\\K^(k)-K{k)\\{\\K^(k)\\ 
||-K"(fc)||). Considering that L(-, •) is a continuous function of its arguments (see [2T]) and V is a com- 
pact set, we know that ||ifW(fc)|| and are uniformly bounded. Hence, \\K ^{k)\\ + \\K(k)\\ < 
M. Now, using Cauchy-Schwartz inequality 22J, we get 



T-l 



T-l 



T-l 



i uM{k) T Ru^( k ) - u(k) T Ru(k) < \\R\\M i £ \\K®{k) X(fc)|| 2 ( ^ E H 3 **)! 



fe=0 



k=0 \ k=0 

Let us introduce the notation K° = h(A, B). Note that, for all p > 0, we have 

T-l T-l 

- W K(%) (k) K°\\ 2 <p 2 + -E || Jf«(k) - K°\\\(\\K®(k) - K°\\ > p). 



T 



fe=0 



Again, considering the facts that L(-, •) is a continuous function of its arguments and V is a compact 
set, we know that \\K^ l \k) — K°\\ is uniformly bounded. Thus, using from Theorem 13.11 we 

can show that limsupy^^ A YlikZo ^\K^ l \k) — K°\\ < p 2 , for all p > 0. Since the choice of p was 
arbitrary, we get limsupy^^ 

T £fc=o ll-^ W ( fc ) " ^11 2 = 0. With a similar reasoning, we can also 
prove that lim sup^^ A X)fe=o 11^0) "^ll 2 = °- Therefore, considering that ||J^ l )(fc)- Jf (fc)|| 2 < 
||ifW(*r) - X°|| 2 + ||iT(fe) - -ftT°|| 2 , we have limsup^^ A ^fc=o II^O) - K (k)\\ 2 = 0. Hence, 
limsupy^QQ Ci{T) = due to the fact that limsup T _ j . 00 |jx(fc)|| 4 < oo according to ([9]). 

• Asymptotic behavior of C5(^) : With the same approach as in case (C) in the proof of Theorem 6 
in [T5], we can prove limsup T _ ) , 00 ^(T) = 0. 

• Asymptotic behavior of C2<D : Let us start with studying the asymptotic behavior of the sequence 
{&°(r)}?Lo where 



T-l 



<f«(T) = i J2x(k) T (A^ (k) + B^(k)K^(k)) T P(A^(k),B^(k))(A^(k) + B^(k)K^(k))x(k) 

- x{k) T (A + BK{k)) T P{A {i \k), B^(k)){A + BK{k))x{k). 



k=0 



(14) 



Using Lemma 13.21 we can upper bound each term as 

x(k) T (A w (fc) + B w (k)K W (fc)) T P(A w (fc), B w (fc))(A w (fc) + B w (A) JC (i) (fc))x(fc) 
- x(k) T (A + BK(k)) T P(A^{k),B^(k)){A + BK{k))x{k) 
< \\x(k)\\ P(A«(fc),B«(k)) [A^{k) + B {t \k)K {i \k)]-[A + BK{k)\ 

[A® (k) + B w (*)] + [A + BK(k)] 



(15) 



Considering again that L(-, •) and P(-, •) are continuous functions of their arguments (see [213) anc ^ 
V is a compact set, we know that 



P(A^(k),B^(k)) 
[A® (fc) + B« (&)#« (fc)] + [A + Bif (fc)] 



<Mi, 
< M 2 . 
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Using Cauchy-Schwartz inequality, we get the inequality in 
$\T) < M 1 M 2 -Y, Mk)\\ 2 [A {i \k)+B^{k)K {l \k)] - [A + BK{k)] 



T-l 



1/2 



T-l 



< M t M 2 - Mk)\\ 4 ) ^ E |[AW(A)+BW(*)iirW(fc)] - [A + BK(k)\ 



1/2 



fc=0 



(16) 



Now, note that 

[A® (k) + B« (k)K® (k)] -[A + BK(k)] 2 < [A« (fc) + flW (k)K® (A)] - [A + BK® (fc)] 

+ ||L4 + BA (l) (fc)] - L4 + BiT ]|| 2 
+ ||L4 + BA°]-[^ + BA(fc)]|| 2 



+ ||B|| 2 f || A«(fc) - A°|| 2 + ||iT(fc) - #"1 



< 



Hence, with similar argument as above, we can prove that limsup T „ ! . 0O Q* C^) = 0) an d a s a result 
limsupy^QQ C2CO = limsupy^^ j? J2iLi Q CO = 0. Now, we are ready to prove the statement of 
this theorem. From the asymptotic behavior of sequences £i (T) and £3 (T) , we know that 

as 

tv{P(A,B)} > hmsupCi(T) + ( 3 (T). (17) 

T^oo 

Using identity (fT3|) inside inequality (fT7|) shows that 

tr{P(A,B)} > limsupC 4 (T) + 0s(T) + C 2 (T) + limsup - }J [x(k) T Qx{k) + u{k) T Ru(k)}, 
which result in 

tr{P(A, B)} > limsup - ^ [x{k) T Qx{k) + u(k) T Ru(k)} 



T-s-oo 



T->oo 



fc=0 



T-l 



T->oo 



fc=0 



This inequality finishes the proof. ■ 
Now, we are ready to present the solution of problem 

COROLLARY 3.4 For any plant graph G-p and design graph Gc, we get r^ c (T*) = 1, and r^ p (r*) = 
1. 

Proof: The proof is a direct consequence of Theorem 13.31 and Lemma 12.11 ■ 

Corollary 13.41 shows that, irrespective of the plant graph G-p and design graph Gc, there exists 
a limited model information control design strategy that can achieve a competitive ratio equal one. 
This control design strategy gives adaptive controllers achieving asymptotically the closed-loop per- 
formance of optimal control design strategy with full model information. Note that earlier results 
stated that such competitive ratio cannot be achieved by static or linear time-invariant dynamic 
controllers [5HT4"]. 



4 Example 



As a simple numerical example, let us consider the problem of regulating the distance between N 
vehicles in a platoon. We model vehicle i, 1 < i < N, as 



Xi(k + 1) 
Vi(k + 1) 



I + AT 



1 

-ai/nii 



Xi(k) 
Vi(k) 



+ 





ATft/m 



Ui(k) 



w\(k) 

4(k) 



»>9 
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where Xi(k) is the vehicle position, Vi(k) its velocity, rrii the mass, a>i the viscous drag coefficient, 
Pi the power conversion quality coefficient, and AT the sampling time. For each vehicle, stochastic 
exogenous inputs wUk) € R™, j — 1,2, capture the effect of wind, road quality, friction, etc. For 
simplicity of presentation, let us consider the case of N — 2 vehicles. In addition, assume that 
AT = 1. As performance objective, the designer wants to minimize the cost function 



T-l 



J = limsup- V [q d (xi(k) - x 2 (k) - d*) 2 + V q v {v t {k) ~ v*) 2 + r(ui{k) - u*) 2 ] , 



to regulate the distance between the trucks with minimum control effort. Note that u* — a.iV* /fii 
is the average control signal. We can write the reduced-order system using the distance between 
vehicles and their velocities as state variables in the form 



z(k + 1) = Az(k) + Bu(k) + w(k), z(0) = 0, 



(18) 



where 



and 



z(k) = [v\{k) — v* , xi(k) — X2(k) — d* , 

u{k) = [ui(k) - u* , u 2 {k) - U2} 1 , 

w(k) = [w\{k) , wl(k)+w 2 (k) , w 2 (k)] T 



A 



v 2 (k) 



_ Q l 







r 


" 


mi 






mi 




1 


1 -1 


, B = 











1 - 







h_ 




ma - 




m 2 - 



This model leads to 



T-l 



J = limsup — ^ z(k) T Qz(k) + u(k) T Ru(k), 



(19) 



where Q — diag(g 1J , qa, q v ) and i? = diag(r, r). To simplify the presentation, let Q = I and R = I. 

Note that z(0) = in ([T8"]) indicates that the vehicles start from desired distance d* of each 
other and with velocity v*. However, due to the exogenous inputs w(k), the vehicles will drift away 
from this ideal situation. By minimizing the closed- loop performance criterion in (|19|) . the designer 
minimizes this drift using the least control effort possible. 

We define the first subsystem as z_i{k) = z\{k) and the second subsystem as z 2 (fc) = i z 2(k) z^(k)] T . 
Therefore, we get 

Zx(k + 1) = a 11 z_ 1 (k) + b 11 u 1 (k) + wi(k), 



and 



z 2 (k + l) = 



1 -1 
a 2 2 



z 2 (k) + b 2 2U2(k) 



w 2 (k) 
w 3 (k) 



where (an, bn) are local parameters of subsystem i. Assume that 



A 



AG 



t 3x3 



A 



an 
1 1 -1 
a 22 



,an,a 22 G [0, 1] 



B 



B G 



0.3x2 



[0.5,1.5] 



611 
B = 
622 

Note that we can verify Assumptions 12.11 and 12.21 using the observability and controllability of struc- 
tured system, see |23U24j . 

We compare the performance of the introduced adaptive controller with a deadbeat control design 
strategy T A : V — > R 2x3 for this special family of systems as 



T A (P) 



-a n /bu 
I/622 





I/622 





-(1 + a 22 )/b 22 
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Figure 1: The running cost of the closed-system with different controllers. 
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Figure 2: Estimation error of model parameters. 

for all P — (A, B) g V . Note that T A is a limited model information control design strategy, because 
each local controller i is based on only parameters of subsystem i, i = 1,2. 

Figure [T] illustrates the running cost of the closed-system with the optimal control design with 
full model information K*(P) (solid curve), the modified Campi-Kumar adaptive controller T*(P) 
(dashed curve), and the deadbeat control design strategy T A (P) (dotted curve). The running costs of 
the closed-system with T*(P) and the optimal control design with full model information K*(P) both 
converge to tr{P(A, B)} (the horizontal line) as time goes to infinity. The cost of the optimal control 
design strategy with global model knowledge is always lower than the cost of the adaptive controllers. 
The simulation is done for parameters (a n ,bu) = (0.4360, 1.0497) and (022,622) = (0.0259,0.9353). 
Figure ?? illustrates the convergence of the individual model parameters (an, ha), i = 1,2, for the 
adaptive subcontrollers. Note that only one of the subsystems need to estimate each parameter (as 
each one has access to its own model parameters). 

5 Conclusion 

In this paper, as a generalization of earlier results in optimal control design with limited model 
information, we searched over the set of control design strategies that construct adaptive controllers. 
We found a minimizer of the competitive ratio both in average and supremum senses. We used 
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the Campi-Kumar adaptive algorithm to setup an adaptive control design strategy that achieves a 
competitive ratio equal to one, that is, this adaptive controller asymptotically achieves closed-loop 
performance equal to the optimal centralized controller with full model information. Wc illustrated 
the applicability of this adaptive controller on a vehicle platooning problem. As a future work, we 
suggest studying decentralized adaptive controllers. 
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A Proof of Lemma 12.11 

Let us assume, without loss of generality, that rp P (r) < oo since otherwise, the desired inequality 
is trivially satisfied. First, note that using Theorem 2.10.1 in [35], function Jp(T(P)) / Jp(K* (P)) is 
integrable on V since we assumed r™ p (r) = ess sup J p(I '(P)) / J p(K* (P)) < oo (and V is a compact 
set thanks to Assumption ^. 1[) . Then, using Theorem 2.7.1 in [22], we get 

rr(r) = S v MK* P (P)) f{P) dP ~ S/v V ^)f{P) dP = r s ^(T). 
This completes the proof. 



B Proof of Lemma 13.11 

Equations ©-([7]) are direct consequences of Theorems 2 and 3 in [15]. We start with proving ©. 
To do so, let us prove ||i^(fc) — L(A, B)\\ > p implies that there exists at least an index i such that 
\\TiK^{k)-T-L{A,B)\\ > p/y/N. We can prove this fact by contradiction. Assume that there does 
not exists any index i such that \\T t K^\k) - T l L(A,B)|| > p/y/N. Therefore, for all 1 < i < N, 
we have \\TiK^ (k) — Ti~L(A, P>)\\ < p/\/N, and as a result, according to Theorem 1 in [25], we get 

N 

\\K(k)-L(A,B)\\ 2 <Y,\\^K {l) (k)-TMAB)\\ 2 <p 2 . 

i=l 

This is contradictory to the assumption that (fc) — L(A, B)\\ > p. Hence, we proved the implica- 
tion. Based on this property, it is easy to see that 

k k N 

^2x(\\K(k)-L(A,B)\\ >p)<^^ X (||T^W(fc)-T i L(AP)||>p/V]V). (20) 
t=o t=0 i=l 

Now, note that \\T t K^ (k) -T t L(A B)\\ > p/VN implies that \\K^(k)-L{A,B)\\ > p/y/N. Thus, 
we get 

k k 

J2x(\\TiK^(k)-TMA,B)\\ > p/VN) <J2 X (\\K^(k) -L(A,B)\\ > p/y/N). (21) 
t=o t=o 
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Substituting inside ([20]). we get 

fe fc jV 

£ xdl^(fc) - HA B)\\> P )<J2J2 (*) - HA B)\\ > p/Vn). 

t=0 t—o i=l 

Now, using ([7]), we can show that 

k 

Y^xiWK^ik) - HAB)\\ > p/Vn) = o(p(k)), 

t=o 

for all 1 < i < N. Therefore, we have 

J2x(\\K(k)~L(A,B)\\ > P ) a dO{^k)). 
t=o 

Finally, note that the proof of (O is a direct result of applying © to the proof of Theorem 5 in [TS] . 
This concludes the proof. 

C Proof of Lemma 13.2 

First, note that 

(X - Y) T P(X + Y) + (X+Y) T P(X -Y) = 2(X T PX - Y T PY). 

Hence, we get 

2\\(X T PX - Y T PY)\\ = \\{X -Y) T P(X + Y) + (X + Y) T P(X -Y)\\ 

< \\{X -Y) T P(X + Y)\\ + \\(X + Y) T P(X -Y)\\ 

< 2||p||||x-r||||x + y|| 
<2||p||||x-y||(||x|| + ||y||). 

This concludes the proof. 
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